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ABSTRACT 

We examine how lensing tomography with the bispectrum and power spectrum can constrain 
cosmological parameters and the equation of state of dark energy. Our analysis uses the full 
information at the two- and three-point level from angular scales of a few degrees to 5 ar- 
cminutes (50 < I < 3000), which will be probed by lensing surveys. We use all triangle 
configurations, cross-power spectra and bispectra constructed from up to three redshift bins 
with photometric redshifts, and all relevant covariances in our analysis. 

We find that the parameter constraints from bispectrum tomography are comparable to 
those from power spectrum tomography. Combining the two improves parameter accura- 
cies by a factor of 3 due to their complementarity. For the dark energy parameterization 

1/2 

w(a) = wo + w a (1 — a), the marginalized errors from lensing alone are a(wo) ~ 0.03/ sky 

— 1/2 

and (j(w a ) ~ 0.1 / sky . We show that these constraints can be further improved when com- 
bined with measurements of the cosmic microwave background or Type la supernovae. The 
amplitude and shape of the mass power spectrum are also shown to be precisely constrained. 
We use hyper-extended perturbation theory to compute the nonlinear lensing bispectrum for 
dark energy models. Accurate model predictions of the bispectrum in the moderately non- 
linear regime, calibrated with numerical simulations, will be needed to realize the parameter 
accuracy we have estimated. Finally, we estimate how well the lensing bispectrum can con- 
strain a model with primordial non-Gaussianity. 

Key words: cosmology: theory — dark energy — gravitational lensing — large-scale struc- 
ture of universe 



1 INTRODUCTION 

Various cosmological probes have given strong evidence that a dark energy component, such as the cosmological constant, constitutes 
approximately 70% of the total energy density of the universe. The most striking evidence comes from observations of supernovae in distant 
galaxies (Riess et al. 1998; Perlmutter et al. 1999, 2000), mapping temperature anisotropies in the cosmic microwave background (CMB) sky 
(e.g., Spergel et al. 2003) and detections of the integrated Sachs-Wolfe effect via the cross-correlation between the CMB and the large-scale 
structure (Boughn & Crittenden 2003; Nolta et al. 2003; Scranton et al. 2003). If the dark energy is constant in time, its natural value is 50 
to 120 orders of magnitudes larger than the observed value (Weinberg 1989). A model that allows the dark energy to dynamically evolve 
could avoid this fine tuning problem (see Ratra & Peebles 2003 for a review). Such a dark energy model can be characterized by a time- 
dependent equation of state w(a) = pde/pde with w < (the cosmological constant corresponds to w — —1). It is useful to treat this as a 
parameterization to be determined empirically, due to the lack of compelling models for the dark energy (e.g. Turner & White 1997). 

The dynamically evolving dark energy affects the expansion rate of the universe, which in turn alters the redshift evolution of mass 
clustering. It is well established that weak lensing can directly map the mass distribution along the line of sight by measuring the correlated 
distortion of images of distant galaxies (see Mellier 1999; Bartelmann & Schneider 2001 for review; and also see, e.g., Hamana et al. 2003, 
Jarvis et al. 2003, Van Waerbeke & Mellier 2003 and references therein for the state of observations). Future wide-field multi-color surveys 
are expected to have photometric redshift information for distant galaxies beyond z = 1 (see e.g. Massey et al. 2003). This additional 
information is extremely valuable in that it allows us to recover radial information on the lensing field, which probes the redshift evolution of 
the expansion history and mass clustering. Since the weak lensing observables are predictable ab initio given a cosmological model, lensing 
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tomography is a well-grounded means of constraining dark energy evolution (Hu 1999, 2002a,b; Huterer 2002; Futamase & Yoshida 2001; 
Abazajian & Dodelson 2003; Heavens 2003; Refregier et al. 2003; Benabed & Van Waerbeke 2003; Jain & Taylor 2003; Bernstein & Jain 
2003; Simon et al. 2003). While luminosity distance measures of supernovae are an established direct probe of dark energy (e.g., Chiba & 
Nakamura 2000; Huterer & Turner 2001; Tegmark 2002; Frieman et al. 2003; Linder 2003), it is important to perform cross checks of various 
methods to understand systematics and because they have sensitivities to different redshift ranges. 

Non-linear gravitational clustering induces non-Gaussianity in the weak lensing fields, even if the primordial fluctuations are Gaussian. 
This non-Gaussian signal thus provides additional information on structure formation models that cannot be extracted by the widely used 
two-point statistics such as the power spectrum. The bispectrum, the Fourier counterpart of the three-point correlation function, is the lowest- 
order statistical quantity to describe non-Gaussianity. Future wide-field surveys promise to measure the bispectrum of lensing fields at high 
significance, as will be shown here. The primary goal of this paper is to determine the expected accuracy on cosmological parameters 
from tomography that jointly uses the lensing power spectrum and bispectrum. The lensing bispectrum or skewness have been applied for 
cosmological parameter estimation in previous studies (Hui 1999; Benabed & Bernardeau 2001; Cooray & Hu 2001a; Refregier et al. 2003). 

In this work, we use the lensing bispectra over all triangle configurations available from a given survey to compute the signal-to-noise 
for the bispectrum measurement. In addition, we use all the cross- and auto-bispectra constructed from the lensing fields in redshift bins. 
To do this, we correctly take into account the covariance in the analysis. We study all the parameters that the lensing power spectrum and 
bispectrum are sensitive to and present results for desired parameters with different marginalization schemes. 

We employ the cold dark matter (CDM) model to predict the lensing power spectrum and bispectrum for dark energy cosmologies. The 
parameter constraints derived thus depend on the model of mass clustering (see Jain & Taylor 2003 and Bernstein & Jain 2003 for a model- 
independent method). Hence, we derive the constraints on dark energy parameters marginalized over the other cosmological parameters on 
which the lensing observables depend. On angular scales below a degree, non-linear evolution leads to a significant enhancement of the 
bispectrum amplitude (compared to perturbation theory), and is likely to amplify the dark energy dependences (e.g., Hui 1999; Ma et al. 
1999). We employ an analytic fitting formula of the mass bispectrum given by Scoccimarro & Couchman (2001; see also Scoccimarro & 
Frieman 1999) to make model predictions of the lensing bispectrum. Since the physics involved in weak lensing is only gravity, developing 
an accurate model of the non-linear bispectrum is achievable from iV-body simulations, as has been done for the non-linear power spectrum 
(e.g., Smith et al. 2003, hereafter Smith03). 

In addition, tomography of the lensing bispectrum offers the possibility of constraining primordial non-Gaussianity. While CMB ob- 
servations so far have shown that the primordial fluctuations are close to Gaussian (e.g., Komatsu et al. 2003), it is still worth exploring 
primordial non-Gaussianity from observations of large-scale structure at low redshifts, since the length scales probed by these two methods 
are different. Exploring primordial non-Gaussianity provides useful information on the physics involved in the early universe, such as particu- 
lar inflation models (e.g., Wang & Kamionkowski 1998; Verde et al. 1999; Komatsu 2002; Dvali et al. 2003; Zaldarriaga 2003 and references 
therein). The lensing bispectrum due to primordial non-Gaussianity has a different redshift evolution from that due to the non-Gaussianity of 
structure formation and, in addition, their configuration dependences differ. Hence, bispectrum tomography can be useful in separating these 
two contributions. We estimate how measuring the lensing bispectrum from future surveys can constrain the primordial non-Gaussianity. 

The structure of this paper is as follows. In §2, we present the formalism for computing the power spectrum, bispectrum and cross- 
spectra of the lensing convergence. The covariance between the bispectra in redshift bins is derived. §3 presents the signal-to-noise ratio for 
measuring the lensing bispectrum for future wide-field surveys. From this estimate, we show how lensing tomography can put constraints on 
primordial non-Gaussian model. In §4, we present the Fisher matrix formalism for lensing tomography. In §5 we present the forecasts for 
constraints on cosmological parameters and the equation of state of dark energy. We conclude in §6. 

We will use the concordance ACDM model with fi cdm = 0.3, fib = 0.05, fide = 0.65, n = 1, h = 0.72 and as = 0.9 as supported 
from the WMAP result (Spergel et al. 2003). fi c dm, fib and fide are density parameters of the cold dark matter, baryons and the cosmological 
constant at present, n is the spectral index of the primordial power spectrum of scalar perturbations, h is the Hubble parameter, and as is the 
rms mass fluctuation in spheres of radius 8/i _1 Mpc. 



2 LENSING POWER SPECTRUM AND BISPECTRUM TOMOGRAPHY 



2.1 Preliminaries: cosmology 

We work in the context of spatially flat cold dark matter models for structure formation. The expansion history of the universe is given by 
the scale factor a(t) in a homogeneous and isotropic universe. The scale factor during the matter dominated epoch is determined by density 
contributions from non-relativistic matter density fi m (the cold dark matter plus the baryons) and dark energy density fide at present, in units 
of the critical density 3Hq/(8itG), where Ho = 100 h km s _1 Mpc -1 is the Hubble parameter at present. The expansion rate, the Hubble 
parameter, is given by 



H 2 (a) 



Hi 



fi m a ' + fid c e 



-3 J"" da'(l+w(a'))/a' 



(1) 



where we have employed the normalization a(to) = 1 today for our convention and w(a) specifies the equation of state for dark energy as 
Pde _ 1 dlnpd, 



w(a) = 



Pde 



3 din a 



- 1. 



(2) 



Note that w = — 1 corresponds to a cosmological constant. The comoving distance x( a ) f rom an observer at a = 1 to a source at a is 
expressed in terms of the Hubble parameter as 
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This gives the distance-redshift relation x{ z ) vla tne relation 1 + z = l/o. 

Dark energy that has negative pressure (p<jc + 3pdc < as observed today) leads to repulsive gravity, and therefore does not cluster 
significantly. However, dark energy does affect the growth of mass clustering through its effect on the expansion rate. In linear theory, all 
Fourier modes of the mass density perturbation, S(= Sp m / p m ), grow at the same rate: Sf, (a) — D(a)Sf, (a — 1), where D(a) is the growth 
factor normalized as D(a = 1) = 1 and, in the following, the tilde symbol is used to denote the Fourier component. The growth factor can 
be computed by solving the linearized differential equation, 5^ + 2{a/a)5j t — AnGpmS^ = 0, where the dot is the derivative with respect 
to physical time. Hence, the growth suppression rate (growth rate relative to that in a flat, matter-dominated universe), g(a) = D(a)/a, can 
be obtained by solving the differential equation (e.g., Wang & Steinhardt 1998): 

+ [5 - 3w(a)O de (a)] + 3 [1 - w(a)] Q dc (a)g(a) = 0, (4) 

a In a z a In a 

where fl dc (a) is the dark energy density parameter at epoch a. We employ the initial conditions of g = 1 and dg/dlna — at dj = 1/1100, 
which is valid for the dark energy models we consider. Throughout this paper we assume a spatially smooth dark energy component (its 
spatial fluctuations matter on length scales comparable with the present-day horizon scale, which are not probed by weak lensing surveys of 
interest). For example, g(a) for dark energy cosmologies are shown in Figure 2 of Linder & Jenkins (2003). 

In this paper we mainly employ the as normalization for the input linear mass power spectrum, equivalent to setting D(a = 1) = 1 
today. In this case, increasing w from w = — 1 or equivalently increasing the dark energy contribution at z > leads to slower redshift 
evolution of the growth rate, since structure formation freezes at higher redshift; for example D w= -o.^(z) > D w= -i(z) for z > 0. Hence, 
the mass power spectrum amplitude is greater for w > — 1 than that for the ACDM model at z > 0. 



2.2 Weak lensing fields 

In the context of cosmological gravitational lensing, the convergence field is expressed as a weighted projection of the three-dimensional 
density fluctuation field between source and observer (e.g., see Bartelmann & Schneider 2001; Mellier 1999 for reviews): 

k{0)= [ XH dxW( X )S[x,X0], (5) 
Jo 

where 8 is the angular position on the sky, x is tne comoving distance, and xh is the distance to the horizon. Note that for a flat universe 
the comoving angular diameter distance is equivalent to the comoving distance. Following Blandford et al. (1991), Miralda-Escude (1991) 
and Kaiser (1992), we have used the Born approximation, where the convergence field is computed along the unperturbed path. The lensing 
weight function W(x) is defined as 

W(x) = \n m0 Hla-\x)x±- f XH d X s p s (z)^^^, (6) 
2 n g J x dxs Xs 

where p s (z) is the redshift selection function of source galaxies and satisfies the normalization condition J^°dz p(z) = n g , with the average 
number density per unit steradian n g . Following Huterer (2002), we employ the source distribution given by 

Ps{z) = n e ^e-*l*\ (7) 

with zo = 0.5. As shown in Figure 1, p(z) peaks at 2zo = 1 and has median redshift of galaxies z mct j = 1.5. 

Future surveys will provide photometric redshift information on source galaxies. This additional information allows us to subdivide the 
galaxies into redshift bins. The average number density of galaxies in a redshift bin i, defined to lie between the comoving distances Xi and 
Xi+i, is given by 

Xi+1 dz 

dXsPs(z)-r—. (8) 

dXs 

Note that this number density determines the shot noise contamination due to the intrinsic ellipticities of galaxies for the power spectrum 
measurement in the bin (see equation (14)). The convergence field for subsample i is 



«(i)(0)= / d X W (l) ( X )S[x,xO], (9) 
Jo 

with the lensing weight function W^) given by 

Wo -y . f Xi+1 , , .dz X s ~X 

u/ l \ ) ~T a Wt dXsPs(z)— , x<Xi+i, nm 

W (i) (x) = { rn Jmax{ x , Xi } d Xs Xs (10) 

0, X>Xi+i- 

where Wo = 3/2 f2 m0 i/o- We have ignored possible errors in the photometric redshifts for simplicity. How a dynamically evolving dark 
energy model changes the lensing weight function is shown in Figure 3 in Huterer (2002). For example, increasing w lowers W^) — opposite 
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Figure 1. The assumed redshift distribution of source galaxies. The curve is in units of the galaxy number density per arcmin 2 and normalized as 



J^°dz p(z) = rig = 100 i 



to the dependence of the growth rate of mass clustering (for cr$ normalization). Thus the dependence of lensing observables on the equation 
of state is somewhat weakened by these two effects. We have checked that the redshift evolution of the lensing weight function provides the 
dominant constraints on the equation of state (see also Figure 2 in Abazajian & Dodelson 2003). 



2.3 The power spectrum and its covariance 

To compute the power spectrum and bispectrum of the convergence, we employ the flat-sky approximation (Blandford et al. 1991; Miralda- 
Escude 1991; Kaiser 1992). Within this framework the lensing convergence field is decomposed into angular modes based on the two- 
dimensional Fourier transform: n(6) — K^e 1 ' ^. The angular power spectrum, C(l), is defined as 

{k h k l2 ) = (2n) 2 S D (l 12 )C(h), (11) 

where S D (l) is the Dirac delta function, (■ ■ •) denotes ensemble averaging, and l r2 = li + l 2 . 

For lensing tomography, we use all the auto- and cross-power spectra that are constructed from source galaxies divided into redshift 
bins. The angular power spectrum between redshift bins i and j, C^j) (I), is given by 



W) (0= / d X W {i) {x)W u) (x)x 





k = -;x 
X 



(12) 



where the lensing weight function Wu) is given by equation (10) and Ps (k) is the three-dimensional mass power spectrum. Using n s redshift 
bins leads to n s (n s + l)/2 cross and auto power spectra. Note that we have used Limber's equation (e.g. Kaiser 1992), which is a good 
approximation over the angular modes we consider, with 50 < / < 3000, corresponding to angular scales between 5' and a few degrees. The 
non-linear gravitational evolution of Ps(k) significantly enhances the amplitude of the lensing power spectrum on angular scales I ^ 100 (see 
Figure 2). Therefore, we need an accurate model of Ps(k), for which we employ the fitting formula proposed by Smith et al. (2003, hereafter 
Smith03). We assume that the Smith03 formula can be applied to dark energy cosmologies, if we replace the growth factor in the formula 
with that for a given dark energy cosmology from which we compute the non-linear scale &nl, the effective spectral index n c ft and the 
spectral curvature C used in the formula to obtain the mapping between the linear and non-linear power spectra. The issue of accurate power 
spectra for general dark energy cosmologies needs to be addressed carefully (see Ma et al. 1999 and Huterer 2002 for related discussions). 
It is encouraging that Linder & Jenkins (2003) find that the mass functions seen in iV-body simulations for dark energy cosmologies are 
well fitted by the universal fitting formula derived in Jenkins et al. (2001), which is derived from the same iV-body simulations as used in 
Smith03. 

Measured shear correlations contain a shot-noise contribution from the intrinsic ellipticities of source galaxies. Assuming that the 
ellipticity distribution is uncorrelated between different galaxies, the observed power spectrum between redshift bins i and j can be expressed 
as (Kaiser 1992, 1998; Hu 1999) 



71 i 



(13) 



where rii is the average number density of galaxies in redshift bin i, as given by equation (8). The Rronecker delta function enforces the fact 
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Figure 2. The lensing power spectra constructed from galaxies in two redshift bins, < 21 < 1.3 and 1.3 < 22. The solid curves are the results for the 
fiducial ACDM model, computed from the Smith03 fitting formula. The boxes show the expected measurement error due to sample variance and intrinsic 
ellipticities. The sky coverage is / s k y =0.1 and the rms of intrinsic ellipticities, <T e = 0.4. The linear power spectrum for the 1 bin is shown by the thin solid 
line to show how significant the non-linear effect is. The dashed curve shows the shot noise contribution to the power spectrum measurement. 

that the cross power spectrum with i 7^ j is not affected by shot noise. In this sense, the cross power spectrum is an unbiased estimator of 
the cosmological signal. We have ignored other possible contaminations such as observational systematics and intrinsic ellipticity alignments 
(the latter is also likely to be negligible for cross power spectra as discussed in Takada & White (2003)). 

The power spectrum covariance is needed to understand statistical errors on the power spectrum measurement. Extending the expression 
in Kaiser (1998) and Scoccimarro et al. (1999), the covariance between the power spectra Cuj) (I) and C( mn ) (/') is 

Cov[C°y) (0 , C°^ S n) (/')] = + 1) AZ/ k t^™} (^(° b ") (0 + C°in) (OC^m) (')] 

+ i^Sm!m T ^- hV ^ (14) 

where Tuj mn \ is the trispectrum of the convergence fields in redshift bins i,j,m and n, / s k y is the fraction of sky covered and Al is the bin 
width centered at I, so that the area of the shell is A(l) = 2-irlAl. The first term denotes the Gaussian contribution to the covariance, and does 
not lead to any correlation between the power spectra of different / modes. The second term is the contribution due to the non-Gaussianity of 
the weak lensing field. We restrict our analysis to angular scales I < 3000. Over this range, the statistical properties of the lensing fields are 
quite close to Gaussian (Scoccimarro et al. 1999; White & Hu 2000; Cooray & Hu 2001b). This is physically because the weak lensing field 
is caused by independent structures at different redshifts along the line of sight and this makes the lensing field close to Gaussian due to the 
central limit theorem, even though the mass distribution in each redshift slice displays strong non-Gaussianity. In what follows, therefore, we 
ignore the non-Gaussian contribution to the power spectrum covariance. 

Figure 2 shows the lensing power spectra for two redshift bins, leading to 3 different power spectra as indicated. The solid curves are 
the results from the Smith03 fitting formula. We parameterized a lensing survey by the sky coverage /sky = 0.1 (« 4000 degree 2 ), the 
galaxy number density n g = 100 arcmin -2 and the rms of intrinsic ellipticities a t — 0.4, respectively. It is clear that the power spectrum 
for higher redshift bin has greater amplitude because of the greater lensing efficiency described by the lensing weight function Wu) (see 
equation (10)). The correlation coefficient between the power spectra of the redshift bins, Rij = Cuj) (0/[C(ii) (O^Tii) (O] 1 ^ 2 ' quantifies 
how the power spectra are correlated. Even with only two redshift bins, the power spectra are highly correlated as R12 ~ 0.8. One thus 
gains little information from further subdivision of the redshift bins (Hu 1999, 2002a,b). The box around each curve shows the expected 
measurement error at a given bin of I, which includes the cosmic sample variance and the error due to the intrinsic ellipticities. As can be 
seen, this type of survey allows the power spectrum measurements with significant signal-to-noise ratio. The dashed curve shows the shot 
noise contribution to the power spectrum measurement. The comparison with the power spectrum amplitude clarifies that shot noise becomes 
dominant over or comparable with sample variance at / £ 10 3 . It is worth noting that shot noise only contributes to the Gaussian term in 
the power spectrum covariance, thus strengthening the case for our Gaussian error assumption (Cooray & Hu 2001b). Finally, to clarify the 
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effect of the non-linear gravitational clustering, the thin solid curve shows the prediction of C(n) (I) from the linear mass power spectrum: 
the non-linear effect induces a significant enhancement in the amplitude by about an order of magnitude at I >, 100. 



2.4 Convergence bispectrum and its covariance 

2.4.1 Definition 

In this subsection, we consider tomography with the lensing bispectrum, which is the main focus of this paper. The bispectrum is the 
lowest-order quantity to extract non-Gaussianity in the weak lensing field, and thus provides additional information on structure formation 
models relative to the power spectrum (e.g., Takada & Jain 2002, 2003a,b,c, hereafter TJ02 and TJ03a,b,c). For convenience of the following 
discussion, we start with the definition of the bispectrum based on the harmonic expansion of the convergence field that accounts for the 
curvature of a celestial sphere (Hu 2000; Komatsu 2002): k(0) = ^2 lm Ki m Yi m (0). The bispectrum, -B(ijfc); 1 z 2 ; 3 , between the convergence 
fields in redshift bins i, j and k, is defined as 

(«Jimi(i)«J2m 2 (j)«J 3 m3(fc)> = ^ ^ ^ ^ ^ B (ijk)hl 2 l 3 , (15) 

where ^ ^ ^ m ^ ' S ^ Signer- 3j symbol. The triangle condition of \U — lj\ < l k < U + lj is imposed for all (mi, m,2, ms), 

and h + h + I3 = even is required from statistical parity invariance of the angular correlation. 

The relation between the bispectra of the all-sky approach and the flat-sky approximation is given by 



R ~ h h h \ , (^1 + l)(2/ 2 + 1) (2Z 3 + 1) 

B (ijk)hi 2 i 3 « I )\ 4^ *>(.ijk){li,h,l3)- (16) 

We will use this equation to compute the all-sky lensing bispectrum from the flat-sky bispectrum (see below), with the Wigner-3j symbol, 
given by the approximation described in Appendix A. 

For a given cosmological model, we use the flat-sky approximation to predict the bispectrum since it is sufficiently accurate over angular 
scales of our interest. Combining the flat-sky approximation and Limber's equation leads to a simple expression for the bispectrum in redshift 
bins i,j and k, B( ijk )(li,l2,l 3 ): 

(/t (i )(ii)/t )(« 2 )/t(fc)(«3)) = (2tt) 2 B {ijk) (h, l2,h)S D '(Z123) (17) 
with 

B (ljk) (h,l 2 ,h) = / XH dxW(i)(x)Wo)(x)W(fc)(x)x" 4 ^(fei,fe 2 ,fe3;x), (18) 
Jo 

where k t = U/x and Bg(ki,k2, fcs) is the bispectrum of the three-dimensional mass fluctuations. Note that the delta function 5° (Z123) 
in the bispectrum (17) enforces the condition that the three vectors li, I2 and £3 form a triangle configuration in the Fourier space. From 
statistical isotropy, the convergence bispectrum is a function of the three parameters that specify the triangle configuration, e.g., l\, I2 and I3. 
For n s redshift bins, n'l lensing bispectra contribute to the Fisher matrix for a given set of (unequal) (li,h, h) 1 . For comparison, there are 
n s (n s + l)/2 power spectra. Note that equations (17) and (18) show there are symmetry relations among the bispectra in an ensemble 
average sense: e.g., for n s = 2 we have the relations B (112 ) = B (12 i) = B(2ii) and B (12 2) = B(2i2) = B^2i) , but they are different in the 
sense that their estimators and covariances are different, as shown below. 

In the following, we consider two sources for the mass bispectrum Bg : one is primordial non-Gaussianity, which could be imprinted 
in the early universe, and the other is the non-Gaussianity induced by non-linear gravitational clustering from primordial (nearly) Gaussian 
fluctuations. 



2.4.2 Bispectrum from primordial non-Gaussianity 

Following Verde et al. (1999), we consider a model in which the primordial gravitational potential is a linear combination of a Gaussian 
random field <j> and a term proportional to the square of the same random field: 

fc(x) = 4>(x) + fsL [^(x) - (cb 2 (x))] (19) 

where /nl parameterizes the non-Gaussian amplitude: in the limit /nl — > 0, the field becomes Gaussian. The physical motivation for this 
model (given that there is a range of possible non-Gaussian models) is that such a non-Gaussian field naturally arises in slow-roll inflation 
and other inflation models (Luo 1994; Falk et al. 1993; Gangui et al. 1994, and also see Komatsu 2001 for a review) or in models in which 
the epoch of reheating has a spatial dependence (Dvali et al. 2003; Zaldarriaga 2003). The Sachs-Wolfe effect measured in the CMB power 
spectrum implies $ ~ 10~° and the potential remains constant in time on horizon scales. From this amplitude and the current limit on 



1 The number of bispectra is reduced to n s (n s + l)/2 and n a (n a + l)(n s + 2)/6 for isosceles triangles, e.g. ii = I2 ^ h, and equilateral triangles with 
il = h = I3, respectively. 
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/nl < 100 from the CMB bispectrum measurement (Komatsu et al. 2003), the second term in equation (19) is much smaller than the first 
term. As long as $ ^ 1 at a given length scale, the redshift evolution of $ obeys linear theory. 

A future wide-field lensing survey will enable accurate measurements of the lensing bispectrum down to arcminute scales, as shown 
below. On scales larger than a degree, the lensing convergence arises from structures in the linear or weakly nonlinear regime, so that 
different Fourier modes are almost uncorrelated. Hence the lensing field retains the imprint of primordial non-Gaussianity. On smaller scales, 
it remains to be seen how highly non-linear structures show signatures from primordial non-Gaussianity (see Matarrese et al. 2000 for an 
analytic discussion). 

To consider the effect of primordial non-Gaussianity on weak lensing fields, we need to account for the different evolution of modes 
inside the horizon. We assume that, in the linear regime, the evolution can be described by the transfer function. Since the density fluctuation 
field is related to the potential field via the Poissonian equation, the Fourier transform of the density fluctuation field, S(k), at epoch z can be 
expressed as 

S(k;z) =D(z)M k $(k), (20) 
with 

where D(z) is the linear growth rate and T(k) is the transfer function as given e.g. by the BBKS formula (Bardeen et al. 1986). We assume 
that the power spectrum of <S is given by a single power law, P$(fc) oc k n ~ 4 , and the amplitude is typically fc 3 P<i> ~ 10~ 5 as stated 
above. Hence, the linear mass power spectrum at redshift z is given by P/'(fc; z) — D(z) 2 MlPq>(k). For a plausible range of /nl, the 
non-Gaussian contribution to the mass power spectrum is much smaller than the Gaussian contribution as stated above, and therefore it is a 
good approximation to fix the normalization of Pt{k) from the Gaussian term alone, based on measurements such as the CMB anisotropy 
or the a s normalization. 

Even in the linear regime, the bispectrum of mass fluctuations due to primordial non-Gaussianity at redshift z is given by 



B^ G (k 1 ,k 2 ,k 3 ;z) = 2f NL D 3 (z) 



Pt(k<)Pt(M) J Zi +2 P erm. 
Mia Mk 2 



(22) 



Substituting this expression into equation (18) gives the prediction for the lensing bispectrum. We will use the prediction to estimate how a 
wide-field lensing survey can constrain /nl- 



2.4.3 Bispectrum due to gravitational clustering 

Even if the primordial fluctuations are Gaussian, non-linear gravitational clustering at low redshifts leads to non-Gaussian features in the 
mass distribution due to the coupling of different Fourier modes. 

In the weakly non-linear regime (S < 1), the evolution of mass clustering can be described by perturbation theory (e.g., Bernardeau et 
al. 2002). Perturbation theory ceases to be accurate in the non-linear regime (S ^ 1). Non-linear clustering enhances the amplitude of mass 
fluctuations which significantly increases the lensing signal on angular scales below a degree, as shown in Figure 2 (see also Jain & Seljak 
1997). Therefore, it is necessary to account for non-linear effects to correctly interpret measurements from lensing surveys. However, an 
accurate model of the non-linear mass bispectrum is not yet available, in part because simulations needed to calibrate such model predictions 
are challenging to carry out: many realizations of high resolution simulations are needed to calibrate the triangle configuration dependence 
of the bispectrum. For this study we will use the best available models to clarify the usefulness of bispectrum tomography. While the 
resulting parameter accuracy forecasts are likely to be adequate, to actually infer the correct values of parameters from measurements, better 
predictions will be needed. 

There are two well studied analytic models of nonlinear clustering that we have implemented: hyper-extended perturbation theory 
(Scoccimarro & Frieman 1999; Scoccimarro & Couchman 2001) and the dark matter halo approach (e.g., Cooray & Hu 2001a, TJ03b,c, 
Takada & Hamana 2003 and Cooray & Sheth 2002 for a review). These models are accurate at the 10 — 30% level in the amplitude of the 
three-point correlation function (TJ03b,c). In this paper, we employ the fitting formula of Scoccimarro & Couchman for the mass bispectrum, 
since it makes it feasible to evaluate the Fisher matrix of bispectrum tomography with reasonable computational expense compared to the 
halo model implementation: 

B? RAV (fci,fc 2 ,fc 3 ;^) = 2Ff(k 1 ,k 2 )P s NL (k 1 ;z)P s NL (k 2 ;z) + 2 perm., (23) 
with the kernel F 2 a given by 

F 2 eff (fei,fc 2 ) = ^a(n eS ,k 1 )a(n e s,k 2 ) + i + j^) ( fcl fc2 ) b(n eS , fci)6(n e ff, fc 2 ) + i ^ToTT^ c(n off , fci)c(n off , fc 2 ). (24) 
7 2 \K2 fci/ kik'2 7 fcjfc^ 

Note that Pg ,L in equation (23) is the non-linear mass power spectrum for which we use the Smith03 fitting formula. The fitting functions in 
F 2 a are given by 



, i\ 1 + *s 02 (*) y/0-7Qs Kff ) (g/4)"°« +3 - 5 

fl(nCff ' fe) = 1 + ( g /4)"ett + 3.5 ' 

w 1 + 0.4(n eff + 3)q n ^+ 3 

= l +g n cff + 3.5 > 
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Figure 3. Left panel: The convergence bispectra with two redshift bins for equilateral triangles, plotted against triangle side length. For two redshift bins, 
there are 4 different bispectra, as indicated. For comparison the thin solid curve shows the perturbation theory prediction for B(m). Non-linear clustering 
significantly enhances the bispectrum amplitude for I Zi 100. Right panel: Similar plot, but for the bispectra due to primordial non-Gaussianity, which are 
computed from equation (22). The non-Gaussian parameter /nl is taken to be 1. 



C(Wcff ' fc) = l + (2 g )"^+3.5 • 

where n e s is the effective spectral index of the power spectrum at the scale k defined as n e ff(fc) = din P$ /d In k (P^(k) is the linear mass 
power spectrum). The quantities q and Q 3 are defined by q = fc/fc N L with (k 3 /2TY 3 )D 2 (z)P^(k- N i J ) = 1 and Q 3 {n aS ) = (4- 2™ off )/(l + 
2 n ° ff+1 ), respectively, and ag(z) = D(z)a$. 

One can find that for large length scales, where k <C &nl ( or equivalently g< 1), a — b — c — 1 and the bispectrum (23) recovers 
the tree level perturbation theory prediction. On the other hand, on small scales, k 3> fci-jL, a, = cr^ ' 2 (z) y/0.7Q-t(n c f{ ) and b = c = 0, 
so the bispectrum becomes independent of triangle configuration and thus provides the so-called hierarchical ansatz. However, it should be 
noted that a more realistic non-linear clustering likely displays weak violation of the hierarchical ansatz in the strongly non-linear regimes 
(TJ03b,c). 

It is worth mentioning about how the bispectrum (23) depends on equation of state of dark energy. In the weakly non-linear regime, the 
dark energy dependence enters into the growth factor only and it does not modify the configuration dependence (as long as we ignore the 
spatial fluctuations that modify the transfer function on horizon scales, as shown in Caldwell et al. 1998, Ma et al. 1999). In the non-linear 
regime, the dependence is captured in a complex way through the non-linear power spectrum and as(z) and the non-linear scale /cnl(«) in 
the fitting functions a, b and c. However, the dark energy dependence on the tensing bispectrum, obtained after projecting the mass bispectrum 
with lensing efficiency, is mainly determined by the dependence of the lensing weight function (6). 

A comparison of equations (22) and (23) shows the differences between the bispectra due to the primordial non-Gaussianity and grav- 
itational clustering. The first difference is in the redshift dependence: the bispectrum from primordial non-Gaussianity has a slower redshift 
evolution than the gravitational clustering bispectrum. Thus lensing tomography should be effective in separating these two contributions. 
The bispectra also differ in the dependence on triangle configuration: the gravity bispectrum has a stronger configuration dependences on 
large length scales where the primordial non-Gaussianity is relevant. 

Figure 3 shows the convergence bispectra with two redshift bins against side length I of equilateral triangles, as in Figure 2. The left 
panel shows the model predictions from the hyper extended perturbation theory (23), while the right panel shows the predictions from the 
primordial non-Gaussian model (22) with /nl = 1- For two redshift bins, we have 4 different lensing bispectra as indicated. For comparison, 
the thin curve in the left panel shows the lensing bispectrum computed from the perturbation theory of structure formation. It is obvious that 
the non-linear gravitational clustering significantly enhances the bispectrum amplitude relative to the perturbation theory prediction by more 
than an order of magnitude at I £ 100. Note that the halo model used in TJ03c provides the similar bispectra to within 25% difference in the 
amplitude over the range of angular scales we consider. From the right panel, one can find that the amplitude of the bispectrum B^ G from the 
primordial non-Gaussian model is much smaller than that from the structure formation bispectrum B J RAV over the range of angular scales 
we consider (this is true even if we use the current upper limit /nl — 100 in Komatsu et al. 2003). 



2.4.4 Bispectrum covariance 

In this subsection, we derive the covariance of the convergence bispectrum. To do this, we start with considering an estimator of the bispec- 
trum. One practical advantage for measuring the bispectrum (and more generally odd numbered correlation functions) is that the intrinsic 
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Figure 4. The minimum value of /nl in the primordial non-Gaussian model such that the lensing bispectrum is detectable with S/N > 1, for a survey with 
/sky = 0.1. It is plotted against the maximum multipole moment l max . Here we ignore the contribution from nonlinear gravitational clustering. Note that 
— 1/2 

/nl scales as / ' . The different curves show the results for varying numbers of redshift bins used. 



ellipticities do not contaminate the measurement, as long as the intrinsic ellipticity distribution is symmetric with zero mean. Thus the 
measured bispectrum is an unbiased estimator of the cosmological signal: 



B°ijk)(li,l2,l3) ~ B(ijk)(h,h,ls)- 



(26) 



It is also worth noting that the B-mode induced by intrinsic ellipticity alignment does not contaminate the bispectrum measurement due to 
statistical parity invariance (Schneider 2003). 

Extending the expression of the bispectrum covariance derived within the full-sky approach (Hu 2000), the covariance between the 
lensing bispectra (15) in redshift bins can be simply expressed as 



Cov 



(27) 



For finite sky coverage, the bispectrum covariance scales as / s w. We have ignored the non-Gaussian contributions to the covariance that 
arise from the connected 3-, 4- and 6-point functions of the convergence field - see discussion for the power spectrum following equation 
(14). This is in part verified by examining the ratio of the non-Gaussian to the Gaussian contribution to the covariance. For example, this ratio 
for one of these non-Gaussian terms is given by B 2 /Cf, and Figures 2 and 3 show that this is much smaller than unity over angular scales 
of interest. The Kronecker delta functions such as Sup Si p 5/ y in the equation above guarantee that the bispectra of different triangles are 
uncorrelated, which makes the Fisher matrix analysis significantly simplified. Note that, on the other hand, the bispectra in different redshift 
bins are highly correlated. 

In the following Fisher matrix analysis, we impose the condition l\ < h < I3 so that every triangle configuration is counted once. This 
condition leads to the following expression for the covariance: 



Cov 



A 



(llj2j3)f s ^G°J^(ll)G° bj ^l2)C° c k)(l3)Sl 1 l' i Si 2 i 2 5l. 



(28) 



where A(Zi, h,h) — 1 if all I's are different, A(li, I2, I3) — 2 if two I's are repeated and A(Zi, h, I3) = 6 if h — I2 = Z3, respectively. 
The observed power spectrum is given by equation (13). Strictly speaking, although equation (28) is correct only for !i / I2 / I3 when 
we consider redshift bins, we checked that it is a good approximation for the Fisher matrix analysis that follows, because triangles with 
'1 7^ h 7^ I3 provide the dominant contribution over angular scales of interest. Finally, equation (28) shows that for two redshift bins we 
have the equality B(ii2) = S( 12 i) = B(2ii) (similar conditions hold for other indices) as stated below equation (18), but their covariances 
are indeed different for a general set of (1%, I2, Z3). 
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Figure 5. Signal-to-noise ratio estimate for measuring the lensing bispectrum against the maximum multipole moment Z max , as in the previous figure. We 
consider the bispectrum due to structure formation and ignore the contribution from primordial non-Gaussianity. For comparison, the thin dashed curve shows 
the signal-to-noise for the power spectrum for 2 redshift bins. It is clear that 2m ax triangles allow significant S/N for the bispectrum measurement comparable 
with that for power spectrum at i max ?L 1000. 



3 SIGNAL-TO-NOISE FOR THE BISPECTRUM 

Before going to parameter accuracy forecasts, we examine how feasible it is to simply measure a non-zero convergence bispectrum from a 
wide-field lensing survey. Given the bispectrum covariance (28), the cumulative signal-to-noise (S/N) ratio for measuring the bispectra in 
redshift bins can be expressed as 

/ S \ 2 _ ! 

= ^ B hi2h(m) [Cov[B ili2i3(ijfc) ,B ili2i3([!i , c) ]] B hhh ( abc) , (29) 

l m i„<h<h<h< l mB. x (i,j,k),(a,b,c) 

where [Cov] -1 denotes the inverse of the covariance matrix, and i m i n and Z max denote the minimum and maximum multipole moments 
considered. The condition l\ < I2 < £3 is imposed, as stated above. Each of the labels (a, b, c) and (i, j, k) denote redshift bins running over 
1, • ■ • , n 3 , thus providing contributions from nf bispectra to the S/N. Given a survey that probes I up to i max , a rough estimate of the S/N 
is 

f $ \ r 76 ( ^m aX Imax ^m a x \ ^(^m a x) „ ,2 ^.2 ri2 \i 

^ J ~ Jskyt ma x I q q q I q,^ -j3 ~ JskytmiixVKL'miix^V'miixJJ 

in 5 ( f Bk y\ flmax\ ( Qk\ 2 Mm a xC(/ max ) \ 

~ 10 I oTJ llo^J I Too ) { w-* ) ' (30) 

where we have used equation (16) and ^ (3<i ~ 'm ax . ( q q q ^ ~ ^ ^ ^ 01 ' ^ ^ mtr °duced the reduced bispectrum 

Q K defined as Q K ~ B(l) /[C(l)] 2 , often used in the literature (e.g., TJ03c). Note that in the estimate above we ignored the shot noise 
contribution of intrinsic ellipticities to the covariance, and thus the estimate overestimates the S/N (see Figure 5). As implied from Figures 
2 and 3 (also see Figure 6 in TJ03c), a plausible model for the convergence bispectrum leads to 60 ,$ Q K i; 150 for I £ 100 (corresponding 
to 9 £ 1 degree) for the ACDM model. This rough estimate shows that a future survey with / SKy =0.1 would allow for a detection with 
S/N - 300 for I 

max — 10 3 ,if we combine the information from all triangle configurations available. 

We give below a more quantitative S/N estimate for measuring the convergence bispectrum. As described in §2.4.2 and 2.4.3, there are 
two cosmological contributions to the lensing bispectrum, due to the primordial non-Gaussianity and gravitational clustering. The observed 
bispectrum can be expressed as B K = B^ G + RAV . 

In Figure 4 we examine the smallest /nl against the maximum multipole moment l m&K , defined so that the lensing bispectrum is 
detectable with S/N > 1. We compute this /nl by setting B K = B^ G (that is, we ignore the contamination from the bispectrum induced 
by gravitational clustering B GKAV ). The sky coverage is taken to be /nl = 0.1 and the minimum multipole moment Z m i n = 50 so that 
^min ~ 10 x 27r/ yj 47r/ s k y . The figure shows that the tomography of the lensing bispectrum can lower the /nl value, reflecting the fact 
that bispectra of the tomography provide additional information on the S/N. As can be seen, /nl is mainly determined by modes with 
I ^ 100, due to the high-fc suppression of the kernel M k in the 3D mass bispectrum (see equation (21) and Figure 1 in Verde et al. 1999). 
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In addition, at higher I the structure formation bispectrum is more dominant (see Figure 3). Therefore, focusing on large angular scales £ 1 
degree is probably an appropriate strategy for constraining primordial non-Gaussianity. 

— 1/2 

The strongest constraint is /nl ~ 150/ sky for i max = 500 and four redshift bins. This result is comparable to the forecast from the 
galaxy bispectrum measurement from the SDSS/2dF surveys in Verde et al. (1999), where the range /nl = 10 3 — 10 4 was derived (for 
/sky = 10~ 5 — 0.25 for galaxy catalogs). Unless an almost all-sky lensing survey is available, the CMB bisepctrum is likely to provide more 
stringent constraint on /nl, as shown by the WMAP result, —59 < /nl < 134 at 95% confidence, in Komatsu et al. (2003). Hence, we will 
ignore the primordial non-Gaussian contribution in the following analysis. Only if a significantly deeper all sky lensing survey is feasible, 
would it be worth returning to this question and exploring measurement strategy. 

Figure 5 shows the signal-to-noise for measuring the lensing bispectrum due to structure formation, B^ RAV . Note that for / max > 300 
we used a binning of / in the S/N evaluation as described below (see equation (35)), since the direct summation over I's is computationally 
time-consuming for large l m „. It is obvious that a survey with / s k y = 0.1 allows a significant detection of the lensing bispectrum with 
S/N >, 10 for £ max > 300. Bispectrum tomography leads to only slight improvement in the S/N value. For comparison, the top thin 
curve shows the corresponding S/N for measuring the power spectrum for 2 redshift bins, which is roughly estimated as (S/N)ps ~ 
./sky ^max ~ 316(/ sky /0.1)- 1/2 (Z ma x/1000) (the shot noise is ignored). It is apparent that the S/N values for power spectrum and 
bispectrum become comparable at Z max ~ 1000. Further, the bispectrum S/N displays stronger dependence on / max (roughly S/N oc Z^ ax 
around Z max ~ 1000) than seen in power spectrum. Because the bispectrum measurement is contributed from triangles, while the power 
spectrum from different i max modes. 



4 FISHER MATRIX ANALYSIS FOR LENSING TOMOGRAPHY 

We will use the Fisher matrix formalism (see Tegmark et al. 1997 for a review), to examine how tomography with the lensing power spectrum 
and bispectrum can constrain cosmological parameters and the equation of state of dark energy. 



4.1 Methodology 

Given a data vector x, the Fisher information matrix describes how the errors propagate into the precision on parameters p a . The Fisher 
matrix is given by 

where L is the likelihood function of the data set x given the true parameters pi, • • • ,p a . The partial derivative with respect to a parameter 
p a is evaluated around the fiducial model. The Fisher matrix quantifies the best statistical errors achievable on parameter determination with 
a given data set: The variance of an unbiased estimator of a parameter p a obeys the inequality: 

(Apt) > (F- 1 )^, (32) 

where (F^ 1 ) denotes the inverse of the Fisher matrix and Ap a is the relative error on parameter p a around its fiducial value. Note that this 
condition includes marginalization over the other parameters pp (a =fc f3). 

First, we consider power spectrum tomography. Assuming the likelihood function for the lensing power spectrum to be Gaussian, the 
Fisher matrix can be expressed as 

*3=E E ^^[Cov^W,^^- 1 ^^, (33) 

i=i min (i,j),(m,n) 

where the power spectrum covariance is given by equation (14) and [Cov] -1 denotes the inverse matrix. In the Fisher matrix evaluation 
above, we consider n s (n 3 + l)/2 cross- and auto-power spectra taken from n s redshift bins. We will use the power spectrum in 100 bins 
in I rather than at every multipole moment. We checked that the binning makes no large difference on our results, since the lensing power 
spectrum does not have any complex features. 

Similarly, we assume that the likelihood function for the lensing bispectrum is close to Gaussian. The Fisher matrix for bispectrum 
tomography is then given by 

pbisp _ dB hhl3(ijk) r r i-i dB hh i 3(abc) 

t a .& ~ 2^ 2^ [fZ [^[ B iii2h(ijk),o lll2h{abc) \\ — , (34) 

imi„<il<i2<! 3 <!max (», j,fc), (o,6,c) 

where the bispectrum covariance is given by equation (27) and we have imposed the condition h < h < h so that every triangle configura- 
tion is counted once. There are ~ 10 9 triangles for / max = 3000, hence it is computationally intractable to compute the contributions from 
every triangle. We bin li and h, and use the approximation: 

C P = EE A ^E E m t 3(ijk) N^^^-^'.swf 1 dBlll d T bc) ' (35) 

l' l' h (i,3,k),(a,b,c) 
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where l[ , 1' 2 denote the binned values of h and I2 (we will use 100 bins in the following results). Note that we continue to use a direct 
summation over Z3 so that we can correctly account for the condition that the Wigner 3-j symbol is non-vanishing for h + h + I3 = even 
and vanishes for h + h + h = odd, where h and fo are now the central values in the bin. We checked that our results are not sensitive to 
this binning, because the lensing bispectrum is smooth within the bin widths used. 

We next consider the Fisher information matrix for a joint measurement of the lensing power spectrum and bispectrum. To do this 
requires a knowledge of the cross covariance between measurements of the power spectrum and bispectrum, which quantifies how the two 
observables are correlated. However, the covariance has no Gaussian contribution expressed in terms of products of the power spectrum, and 
arises from the 5-point function of the lensing fields. Hence, we ignore the cross covariance for the consistency of the procedure we have 
taken: the Fisher matrix for the joint measurement can then be approximated by 



F af , 



pps I pbisp 



(36) 



It is useful to consider constraint ellipses in a two-parameter subset of the full parameter space. These are obtained by projection of the 
higher dimensional ellipsoids, since the projected ellipses include marginalization over the other parameter uncertainties. Assuming that the 
form of the parameter likelihood function is approximated by a multivariate Gaussian function, the projected likelihood function for the two 
parameters p a and pp is given by (e.g., Matsubara & Szalay 2002): 



£{PaiPp) oc exp • 



-i {Ap a Apg) 




(37) 



where the normalization factor is determined to satisfy J Ap a AppC — 1 and [• ■ denotes the inverse matrix. When many parameters are 
considered (we will use up to 8 parameters), the Fisher matrix formalism has considerable computational advantages over a grid-based search 
in the likelihood space. Finally, the correlation coefficient, which quantifies how the constraints on parameters p a and pg are degenerate with 
each other, is defined as 



\J (F 1 ) aa {F 1 )/3 I 3 

If |r I — 1, the parameters constraints are totally degenerate, while r = means they are uncorrelated. 



(38) 



4.2 The parameter space and fiducial model 

We restrict our analysis to a flat CDM universe, supported by CMB observations (e.g, Spergel et al. 2003). The parameter forecasts derived 
are sensitive to the parameter space used and to whether constraint on a given parameter are obtained by marginalizing over other parameter 
uncertainties. We mainly use seven parameters which determine the lensing observables within the CDM model: tide, wo, w a , as, n, fib/1 2 
and h, where fide and fib are the density parameters of dark energy and baryons, n is the spectral index of primordial scalar perturbations, h 
is the Hubble parameter, and as is the rms mass fluctuation in a sphere of radius 8/i _1 Mpc. We use a simple parameterization of the equation 
of state of dark energy: w(a) — uio + - a) (Turner & White 1997; Linder 2003). Note that we use fi D /i rather than fib, because 
fib ft 2 is directly probed by the CMB observations and we will use its prior constraint in the Fisher analysis. For the input linear mass power 
spectrum, we employ the BBKS transfer function (Bardeen et al. 1986) with the shape parameter given by Sugiyama (1995), which accounts 
for the baryon contribution. We thus ignore the effect of dark energy spatial fluctuations on the transfer function. For slowly varying dark 
energy models with w < —0.6 (the rough current limit from observations, Perlmutter, White & Turner 1999), the BBKS approximation 
is reliable on scales probed by weak lensing surveys. Note that dark energy clustering affects horizon scales, and it therefore modifies the 
COBE normalization through changes to the Sachs-Wolfe effect and the integrated Sachs-Wolfe effect (Caldwell et al. 1998; Ma et al. 1999; 
Hu 2002a). For precise comparisons with data, we will need more accurate transfer function as given by CMBFAST (Seljak & Zaldarriaga 
1996). Finally, in this paper we ignore the possible effect of massive neutrinos on the mass power spectrum (see e.g. Hu 2002a; Abazajian & 
Dodelson 2003). 

The Fisher matrix formalism assesses how well lensing observables can distinguish the true ("fiducial") model of the universe from 
other models. The results we obtain depend upon the fiducial model. We use the currently concordant flat ACDM model with fi de = 0.65, 
too = — 1, w a = 0, as = 0.9, n — 1, fib = 0.05 and h = 0.72, which is consistent with the recent WMAP results (Spergel et al. 2003). 
To compute the Fisher matrix, we need to choose steps in parameter directions to compute the numerical derivatives. We choose the steps 
to be 5% in the parameter values, except for Aw a — ±0.1; two-side numerical derivatives are used. We will pay special attention to how 
parameters fide, too, w a and as can be constrained from lensing tomography. On the other hand, we will employ prior information on n 
fib/1 2 and h, expected from measurements of the CMB temperature and polarization by the Planck satellite mission (e.g., Eisenstein et al. 
1999). Assuming Gaussian priors, we add the diagonal component (.Fprior)^ = Sai3a(p a )~ 2 to the Fisher matrix. Note that we do not 
employ any priors to the parameters fide, wo, w a and as and thus the following results are conservative (see Figure 8 for the joint parameter 
forecasts using lensing tomography and the CMB). 



5 RESULTS: COSMOLOGICAL CONSTRAINTS 

We consider three scenarios for constraining cosmological parameters: using the lensing power spectrum and bispectrum without any tomog- 
raphy, tomography in two redshift bins, and in three redshift bins. Figure 6 shows the resulting constraint ellipses in the parameter space of 
fide, too, TO a and as, corresponding to the 68% confidence level (A\ 2 = 2.3), marginalized over the parameters fib, n and h. We consider 
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Figure 6. Projected 68% CL constraints in the parameter space of f2,j e , wo, w a and as from the lensing power spectrum and the bispectrum in two redshift 
bins, as indicated. We employ 7 cosmological parameters in the Fisher matrix analysis. The results shown are obtained assuming priors on n, Cl^h 2 and 



h expected from the Planck mission. The sky coverage and number density are taken to be f ( 



ky 



0.1 and n„ 



100 arcmin , and angular modes 



50 < I < 3000 are used. It is clear that bispectrum tomography can improve parameter constraints significantly, typically by a factor of three, compared to 
just power spectrum tomography. 



angular scales 50 < I < 3000 and assume the priors a{lnn h h 2 ) = 0.010, a(n) = 0.008 and a(h) = 0.13, expected from the Planck 
mission (see Table 2 in Eisenstein et al. 1999). Each redshift bin is chosen so as to have equal number density of galaxies; < z < 1.3 
and 1.3 < z for the galaxy distribution shown in Figure 1. One can see that bispectrum tomography provides additional information on 
the parameters relative to power spectrum tomography. In particular, one striking result is that the parameter constraints from bispectrum 
tomography alone are comparable with those from power spectrum tomography. As a result, combining the two can significantly improve 
parameter determination by a factor of 3, because of their complementarity. For example, the equation of state parameters are constrained 
as a(w) = 0.034/ s ~^/ 2 and a(w a ) = 0.11/ sk y / ' 2 . There are two essential features needed to obtain these results. First, we considered all 
triangle configurations (~ 10 9 triangles) available from a range of angular scales we consider, which leads to a significant gain in S/N. 
Second, we considered all the cross- and auto-spectra constructed from the redshift bins, providing 3 power spectrum and 8 bispectrum 
contributions to the Fisher matrix. 

The correlation coefficients of the Fisher matrix elements quantify how the parameter constraints are degenerate. From the definition 
(38), we find the dark energy parameters are highly correlated as r(Qdc, wo) = —0.98, r(Qdc, w a ) = 0.95 and r(wo, w a ) — —0.97. It is 
also worth noting the importance of the marginalization in the Fisher matrix formalism. If we fix 0,^, n and h to their fiducial values, the 
constraints improve significantly: a(Qa e ) = 0.0031, a(wo) = 0.038, <j(w a ) ~ 0.15 and a(as) = 0.0035 compared to the errors shown in 
Table 1. 

Table 1 gives the 1-a errors on parameters for three cases: no tomography, tomography with 2 redshift bins, and with 3 redshift 
bins. Clearly tomography leads to significant improvements in parameter accuracies. For the equation of state parameters too and w a , the 
improvement in the 1-a errors is a factor of 10. As stressed in Hu (1999), on the other hand, fine divisions of the galaxy redshift distribution 
for lensing tomography does not give much additional information. This can be seen from Figure 7, which shows the results for three redshift 
bins, < z < 1, 1 < z < 1.7 and z > 1.7: In this case, we considered 27 bispectra and 6 power spectra in the Fisher matrix analysis. The 
constraints on the parameters are improved by 15-20%. 

As shown in Figures 6 and 7, lensing tomography alone leads to degenerate constraints on Side and the equation of state parameters 
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Figure 7. As in the previous figure, but for 3 redshift bins. In this case, we use 6 power spectra and 27 bispectra in the analysis. 
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0.91 


0.36 


0.65 


0.71 


0.31 (0.14) 



Table 1. Summary of parameter constraints from lensing tomography of the power spectrum (PS), the bispectrum (Bisp) and both measurements (PS+Bisp). 

— 1/2 

All errors are 1-a level and include marginalization over the other parameters. Note that we have used f s ^ y =0.1 and all the la errors scale as oc / sky ■ 
The values in the brackets for three redshift bins show the constraints when we combine lensing tomography and CMB observations (see Figure 8 in more 
detail). 

Wo and w a - Therefore, external information on these parameters from other methods such as observations of CMB, supernovae and galaxy 
redshift surveys is valuable in that it allows us to break the degeneracies. This is investigated in Figure 8, which shows how the constraint 
ellipses in the subspace of (fide, wo, w a ) change when we combine the constraints from lensing tomography and CMB. To compute the Fisher 
matrix from measurements of the CMB temperature and polarization power spectra and the cross spectrum, we assumed the experimental 
specifications of the Planck 143 and 217 GHz channels with 65% sky coverage and assumed a spatially flat universe, no massive neutrinos, 
no running tilt and no gravity waves. However, we include the dependence of the dark energy equation of state, which modifies the CMB 
power spectra through the change of the angular diameter distance to the last scattering surface (we ignored its spatial fluctuations). 

In this case, the CMB power spectra are specified by 8 parameters (Qdc, u>o, w a , 8q, n, Qhh 2 , h, r) and have been computed using 
CMBFAST version 4.5 (Seljak & Zaldarriaga 1996). 8q is the amplitude of the primordial curvature fluctuation and r denotes the optical 
depth (their fiducial values are taken to be 5^ = 4.56 x 10 and r = 0.15). Likewise, to compute the Fisher matrix from lensing 
tomography, we employed the normalization of the mass power spectrum instead of the erg normalization for the consistency. Figure 8 
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Figure 8. Error ellipses (68% CL) for dark energy parameters (f^g, wq, w a ) when we use lensing tomography (PS+Bisp) with three redshift bins, CMB 
power spectra from the Planck experiment and combine the two. The constraints include marginalization over the other parameters, including the optical depth 
(see the text in more detail). 
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Figure 9. Parameter accuracy forecasts for the proposed SNAP survey of Type la supernovae and weak lensing, and the joint constraints from the two 
measuremetns. Note that the sky coverage f^y = 0.01 is assumed for the lensing survey and we used the prior cr(Q^ e ) = 0.03 for the supernova constraint. 



shows that combining the constraints from CMB measurements and lensing tomography significantly improves the dark energy constraints, 
yielding cr(fide) = 4.6 x 10 -3 , rr(wo) = 0.046 and a(w a ) = 0.14. Note that the results shown are derived by adding the two (7 x 7) 
Fisher matrices from lensing tomography and the CMB spectra, including the marginalization over the optical depth. It is also worth noting 
that the CMB measurements can better constrain the other parameters (S( , Jib, n, h) than lensing tomography, which is part of the reason for 
the improvement in Figure 8. 

Figure 9 shows forecasts of the dark energy constraints expected from the SNAP survey of Type la supernova measurement (E. Linder, 
private communication) and its lensing survey. We assumed the sky coverage / s k y = 0.01 for the SNAP weak lensing survey (see Massey et 
al. 2003; Refregier et al. 2003). It is again apparent that a joint analysis of weak lensing and supernovae significantly improves the parameter 
errors due to their complementarity: the errors become <r(f2d c ) = 6.7 x 10~ 3 , cr(wo) = 0.061 and a(w a ) = 0.32. Moreover, this result can 
be further improved by combining with the CMB constraints, which gives (for / s k y = 0.01): cr(u>o) = 0.042 and a(w a ) = 0.15. 



5.1 Constraints on running spectral index 

Next we examine how lensing observations can probe the running spectral index of primordial scalar perturbations, motivated from the 
WMAP result (Spergel et al. 2003). CMB measurements probe large length scale fluctuations and the constraint on the running spectral 
index is to some extent diluted by the other parameter uncertainties (especially the optical depth r). In fact, strong constraints on the running 
of the spectral index can be obtained only by combining CMB, galaxy surveys and the Ly-a forest. Weak lensing directly probes the relevant 
small length scales, and is not susceptible to uncertainties due to biasing. The measured spectra however have undergone nonlinear evolution, 
so an accurate model is needed to infer the shape of the linear spectrum. 

To make forecasts for future lensing surveys, we follow the model in Spergel et al. (2003), where the linear mass power spectrum is 
expressed as 
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Figure 10. The constraint ellipses in the parameter space of iDn, w a and the running spectral index parameter dn/dln k. The Fisher analysis is as for Figure 
6. In total, 8 parameters are considered to obtain the marginalized ellipses shown (dn /dink, plus the seven parameters considered in the preceding analysis). 



Ps(k) oc 



(fco) 



n + a s ln(fc/fc )/2 



T 2 (fc), 



(39) 



where a 3 = dn/d\nk and fco = 0.05 Mpc -1 and T(k) is the transfer function. Figure 10 shows the forecast constraints on wo, w a and 
dn/d\nk, for the two redshift bins. Here we considered 8 parameters in the Fisher matrix analysis (dn/dln k, plus the seven parameters 
we have used so far), which enlarges the constraint ellipses in Figure 6. It is clear that lensing tomography can put stringent constraints 
on the running spectral index: a(dn/d In fc) = 0.0037/ s ^y //2 . This result can be compared with the constraint a(dn/d In fc) = 0.004 
expected from the Planck mission (see Table 2 in Eisenstein et al. 1999) and with the constraints expected from galaxy redshift surveys 
(a(dn/d\nk) — 0.42) and the Ly-a forest (a(dn/d\nk) = 0.04 in Mandelbaum et al. 2003) 2 . For this parameter the bispectrum is 
especially powerful, as evident from Figure 10. 



6 CONCLUSION AND DISCUSSION 

We have used lensing tomography with the power spectrum and bispectrum as a probe of dark energy evolution and the mass power spectrum. 
The lensing bispectrum has different dependences on the lensing weight function and the growth rate of mass clustering from those of 
the power spectrum. Thus bispectrum tomography provides complementary constraints on cosmological parameters to power spectrum 
tomography. By using information from different triangle configurations and all cross-spectra in redshift bins we find that the bispectrum 
has roughly as much information as the power spectrum on parameters of interest (see Figures 6 and 7). Parameter accuracies are typically 
improved by a factor of 3 if both the power spectrum and bispectrum are used, compared to the standard approach of using just the power 
spectrum (see Table 1). Thus our study provides strong motivation for the use of the bispectrum from lensing surveys for parameter extraction. 

Since lensing observables are significantly affected by non-linear gravitational clustering on angular scales below a degree, we used a 
non-linear model (see discussion below) to compute the lensing bispectrum and to estimate the precision on the parameters erg, Side, wo, 
w a and dn/dln fc. The constraints on the dark energy parameters are a(wo) ~ 0.03/ s ^y and a(w a ) ~ 0-l/ s ky _ tms sensitivity to 
the redshift evolution of the equation of state of dark energy is comparable to the best methods proposed for the coming decade. Moreover, 
external information on dark energy parameters such as provided by CMB and Type la supernova measurements can significantly improve 
the parameter accuracies as shown in Figures 8 and 9. In addition, lensing tomography can precisely probe the mass power spectrum: the 
constraint on the power spectrum amplitude is a(crg) ~4x 10~ 3 , and on the running spectral a(dn/d\nk) ~4x 10~ 3 / sk y /2 - Our 
analysis includes the full information at the two- and three-point level. Using three-point statistics such as the skewness, which contain no 
information on triangle configurations, weakens parameter constraints significantly. 

Thus there is strong motivation to build an accurate model of lensing observables in the moderately non-linear regime. This should be 
feasible, since the physics involved in lensing is only gravity. It will also be necessary to calibrate the covariance of the bispectrum over 
relevant triangle configurations using a sufficient number of simulation realizations. Such an accurate model of the lensing bispectrum might 
modify the ellipse shapes in Figure 6. Even so, we believe that the level of improvement from bispectrum tomography is likely to be correct, 
because the mass bispectrum we have employed should correctly estimate the amplitude, and therefore the signal-to-noise to within 10-20%. 

We also estimate how bispectrum tomography can put constraints on primordial non-Gaussianity. The most optimal constraint was 
estimated by requiring that the bispectrum due to primordial non-Gaussianity is detectable with S/N > 1. We neglected the bispectrum 
induced by gravitational non-Gaussianity, since it differs in its redshift evolution and configuration dependence. However, even with this 



2 Note that the definition of fco in the model (39) differs in these papers, which affects the 1-a error. 
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assumption, the result in Figure 4 is not very promising: the constraint on the primordial non-Gaussian model is not as stringent as that from 
the CMB bispectrum measurement (Komatsu et al. 2003), unless an almost all-sky lensing survey is available. It is worth exploring how the 
three-dimensional mass reconstruction proposed by Hu & Keeton (2003) could allow us to improve the lensing estimates. 

We have concentrated on statistical measures of the convergence. However, the convergence field is not a direct observable, and re- 
constructing the convergence field from the measured ellipticities (shear) of galaxies is still challenging from survey data. Reconstruction 
techniques have been proposed for the power spectrum and the convergence field from realistic data (see Kaiser 1998; Hu & White 2001 for 
the 2D case and Taylor 2003 and Hu & Keeton 2003 for 3D mass reconstruction). It is of interest to develop an optimal method of extracting 
the lensing bispectrum from the measured shear field. Alternatively, we can use measurements of the three-point correlation functions of the 
shear fields, which have been extensively studied recently by Schneider & Lombardi (2003), Zaldarriaga & Scoccimarro (2003) and TJ03a,c 
(see Bernardeau, Mellier & Van Waerbeke 2003; Pen et al. 2003; Jarvis, Bernstein & Jain 2003 for measurements). The shear three-point cor- 
relation functions carry full information on the convergence bispectrum (Schneider, Kilbinger & Lombardi 2003). Therefore, all the results 
derived in this paper are attainable using tomography of the two- and three-point functions of the shear fields. 

There are other uncertainties we have ignored in this paper. We have assumed accurate photometric redshift measurements. For finite 
errors in the redshifts, we cannot take narrow redshift subdivisions of the galaxy distribution. This error would lead to additional statistical 
errors on measurements of the power spectrum and bispectrum and in turn on the cosmological parameters. Since we use only two or three 
redshift bins in our analysis, the demands on statistical errors are not very stringent. However, possible biases in the photometric redshifts 
must be carefully examined, because the redshift evolution of dark energy leads to only a small effect on the lensing observables, as discussed 
in Bernstein & Jain (2003). 

We have also ignored the B-mode contamination to the lensing observables, though it is seen in current measurements (e.g., Jarvis et 
al. 2003). The main source is likely to be observational systematics which are not eliminated in the PSF correction. For a shallow survey, the 
intrinsic ellipticity alignments could also provide significant contribution to the B-mode on small angular scales. Another useful application 
of photometric redshift information is that it allows us to remove intrinsic alignment contaminations by excluding close pairs of galaxies 
in the same redshift bin (Heymans & Heavens 2003; King & Schneider 2003). Thus the cross-power spectrum calculated from different 
redshift bins is not affected by the intrinsic alignment. Hence, lensing tomography that uses the cross-power spectra and bispectra is robust to 
systematics (Takada & White 2003). The degradation in parameter accuracies is likely to be small, because the cross-power spectrum carries 
comparable lensing signal to the auto-spectrum (see Figure 2 and Takada & White 2003). 

We thank G. Bernstein, D. Dolney, M. Jarvis, E. Komatsu, W. Hu, L. King, E. Linder, S. Majumdar, A. Refregier, R Schneider, R. 
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publicly available. We acknowledge the hospitality of the Aspen Center for Physics, where this work was begun. This work is supported by 
NASA grants NAG5-10923, NAG5-10924 and a Keck foundation grant. 
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APPENDIX A: APPROXIMATION FOR WIGNER-3 J EVALUATION 

The expression for the bispectrum of the convergence field involves the Wigner-3j symbol, which has a closed algebraic form (e.g., Hu 
2000): 
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for even L = h + h + h and zero for odd L. 

Rather than using the exact equation, we employ an approximation of the Wigner-3j symbol evaluation, since the direct numerical 
calculation encounters a divergence problem in the factorial as l\ for large I. For this purpose, we use the Stirling approximation: n! = 
T(n + 1) and 



V{x) ~ (2TY) 1/2 e- x x x - 1/2 , for large x. 



(A2) 



This approximation is quite good, leading to errors less than 0.2% for angular scales I > 50 of our interest. Using this approximation, the 
expression (Al) of the Wigner-3j symbol is rewritten as 
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